TAGs: REINFORCE algorithm